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Abstract. In the present paper we consider the fluctuations of the free energy in the random 
energy model (REM) on a moderate deviation scale. We find that for high temperatures 
the normal approximation holds only in a narrow range of scalings away from the CLT. For 
scalings of higher order, probabilities of moderate deviations decay faster than exponentially. 



1. Introduction 

The random energy model (REM for short) is a disordered spin system from statistical 
mechanics, invented by Derrida in 1980 [HIS]. It is a toy model to describe a system of 
particles that can assume one of the 2''^ accessible states from the set Sn = {—1, +1}^, called 
the configuration space. The energy of a state a G S]\i is given by H{a) = —y/NX^^ where X^^ 
is a A/'(0, l)-distributed random variable, and the energies of different states are assumed to 
be independent, that is, {H(a))aeSN fixed N) a sequence of i. i. d. normal distributed 

random variables. Despite its far-reaching simplifications, the REM is an important model 
from statistical mechanics and has been intensively studied over the last decades. More recent 
expositions of the model can be found in the books [HHS]. 

In the following, let {0,,J-,P) be the probability space on which the triangular array of 
independent AA(0, l)-distributed random variables (Xo- : a G Sn,N G N) is defined. The 
probability of observing a configuration a G Sn of the N particle system is given by the 
random Gibbs measure 

where /3 > is the inverse temperature and Z]\f{(3) a random normalization given by 

= E ^'^''^ 

which is called partition function. Obviously, the minus sign in the definition of the random 
Hamiltonian H{a) and the minus sign in the definition of the Gibbs measure cancel each 
other, however, it is convention to use them. 

In statistical mechanics, one is interested in the existence of the so-called free energy 

Fn{P) ■■= ^logZ^(/3) 

in the limit — t- oo in an appropriate sense. Note that this definition of the free energy 
differs from the one used by physicists by the factor — /3~^, which is constant and, therefore, 
omitted by mathematicians. A complete result on the existence of the free energy in the sense 



Date: December 27, 2012. 

2010 Mathematics Subject Classification. 60F10, 82B44. 

Key words and phrases. Random Energy Model, moderate deviations, large deviations. 

1 



2 



R. MEINERS AND A. REICHENBACHS 



of almost sure convergence and convergence in LP was proved by Olivieri and Picco in 1984 
pTj and reads as follows: 

Theorem 1.1 ([ll])- Let = \/21og2. For all P > 

(§l + & ifB<0 
lim F^(/3) = F(/3) := ^ + 2 ^fP- Pc 
N^cc 1^/3/3^ if 13 > /3c 

P-almost surely and in L^^^l, F, P) for any 1 <p < oo. 

The convergence in implies that the quenched free energy Ei^Ar(/3) also converges to -F(/3) 
and, consequently, 

lim |F^(/3)-EFjv(/3)| = 

N^oo 

holds P-almost surely, which is why the free energy of the REM is said to be a self-averaging 
quantity. Moreover, the annealed free energy is given by 

1 (3^ /32 

and, therefore, the quenched free energy and annealed free energy coincide in the limit N ^ oo 
(3 < Pc- This breaks down for /3 > f3c, where the quenched free energy is strictly less than 
the annealed free energy. 

Even more, one already obtained a precise picture of free energy's deviations and fluc- 
tuations. In view of Theorem II. H it is a natural first step to ask for refinements of this 
limit theorem on the level of large deviations and, therefore, we shall briefly recall what a 
large deviation principle (LDP) is. For a thorough introduction to the field we refer to the 
books [3l[8]. Let {X,Bx) be a measurable space, consisting of a Hausdorff topological space 
X endowed with the Borel a-field Bx- In addition to that, let 7^ — )• oo be a sequence of 
real numbers and / : — ?■ [0, 00] be a lower semicontinuous function. A sequence of random 
variables {Xn)n&] defined on some probability space (S", ^, P) with values in {X^Bx) is said 
to satisfy the large deviation principle (LDP for short) with speed jn and rate function I if 

- inf I{x) < lim inf — log P (X„ G A) < lim sup — log P (X„ G yl) < - inf /(x) 

for all A G Bx- The rate function / is said to be good if the level sets {x £ X : I{x) < c} are 
compact subsets of X for all c G M. 

As already hinted at, the probabilities of C'(l)-deviations from the limiting free energy 
F{I3) have already been quantified. In [S], Fedrigo, Flandoli and Morandin proved a large 
deviation theorem for the free energy, which is stated next: 

Theorem 1.2 (LDP, [9j). The sequence of random variables (-^ log Zjv(/3))jvgN satisfies the 
LDP with speed N and good rate function L given by 

'00 ifx<F{l3) 
L{x) = J ifx = F{f3) 

^^-log2 ifx>F{/3), 

where F{f3) are the limit points of the free energy defined in ([1]). 

Note that large deviation techniques can also be used to prove ([T]) P-a. s. via Varadhan's 
Lemma (cf. [6]). 
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While Theorem [L2] describes the atypical behavior of Fj\f{/3) by studying the probabilities 
of large deviations, the typical behavior is described by theorems on its fluctuations, i. e. 
by theorems on distributional convergence of the properly rescaled free energy. This has 
been done by Bovier, Kurkova and Lowe in [2]. They proved the existence of multiple phase 
transitions on the level of distributional convergence and found that the fluctuations of the free 
energy are exponentially small. What is more, they are Gaussian if and only if /3 < y^log 2/2: 

Theorem 1.3 (CLT, [2]). 
(i) For P < Y^log2/2 



e 



(ii) For 13 = y^log 2/2 



ef('°s2-/32)log( 



^^'^ ^ ''^AA(0,l/2) 



I3,N 



Remark 1.1. Since it will be of some importance for the present paper, we quickly want 
to sketch the course of action followed in [2]: Using the Taylor expansion log(l + x) = 
X + o{x) for X — )• the authors defer the proof of a limit theorem for 



to the more manageable random variable 

f (log 2-/32) ^WV^j^^ _ ^ V YMia^ (2) 

where Yfyf{a) = {^^P^^" — g^/^^/S'jy'gA^/?^ ^^^^^ each N) i. i. d. random variables with mean 
zero and variance s\ = 1 — e~^^ — )• 1 as iV — )• oo. Next, the authors show that Y]^{a) 
satisfies Lindeberg's condition if /3 < Y^log2/2, and obtain Theorem 11.31 (i) by means of the 
CLT for triangular arrays. However, for /3 = y^log 2/2 Yi^{a) does not satisfy Lindeberg's 
condition, which is related to the fact that Yp^^aYs tails become too heavy, and the behaviour 
of the sum ^^Yi^{a) is dominated by extremal events. Yet, the authors can still prove 
convergence in distribution to a normal distribution and attain Theorem [T3] (ii). It is worth 
noting that the authors also acquire complete results for j3 > y^log 2/2, where non-standard 
limiting distributions occur, which is due to the fact that Y]\f(a) has even more weight on its 
tails in these cases. 

In view of Theorem \l.'d\ we ask the following question: can the tail probabilities 



p(^ef(i°g2-/32)log 



13, N 



> t 



be approximated by the tails of a normal distribution even for growing t, that is, does one 
find 

P (ef (^--^^) log (^) > t^) « PmO, 1) > t^) 
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even for % — )• oo? It is well-known (use e. g. ([8])) that 

1 

lim ^logP(AA(0, 1) > xtTv) 

N—>-co tj^ 

for any x > and, thus, we ask for the validity of 

li.i-Io.p(.f<— ^'.,„,(^)>.*„) ^4 (3) 

for any x > or, more general, for the existence of the LDP with speed tf^ and Gaussian 
rate function I{x) = x^ 12 for exp (7V(log 2 - /32)/2) t^Mog(Z^,Ar/ E Z^,Ar). Using the LDP 
of Theorem Ol we see that ^ does not hold if is of order e(iV exp (iV(log2 - /3^)/2)). 
Large deviation results for the remaining cases of scalings between those of the CLT and the 
LDP, i. e. 

tM ^ oo and — , , ^ , , — )• 0, 

iV exp(A^(log2 -/32)/2) 

are commonly referred to as moderate deviation results in the literature, since one asks for 
deviations of Fn{(3) of order o(l) from F{/3). In like manner, LDPs for scalings that are 
between those of the CLT and the LDP are called moderate deviation principles (MDPs). 
However, we will stick to the term LDP, since the formal definitions of the LDP and MDP 
are the same. Note that moderate deviations for mean field models from statistical mechanics 
have already been studied (cf. e.g. [T0lfl2J). 

We will show in this article that ([3]) holds if and only if tjv = o(\/iV), that is, <^ holds only 
in a small range of scalings close to the CLT scaling. This is particularly interesting since 
it is out of harmony with the general picture of moderate deviations obtained by the case 
of partial sums of standardized i. i. d. random variables (Xj)jgN- The prototypical answer for 
this case is that (tn\/^)~"'^ X^ILi satisfies under suitable conditions the LDP with speed 

and Gaussian rate function I(x) = x^/2 for the whole range of scalings between the 
corresponding CLT and LLN (see [T] for a necessary and sufficient condition on this type 
of moderate deviations). In particular, the rate function does not depend on the moderate 
deviation scaling. 

The main result of the present paper reads as follows, where — t- oo is from now on a 
diverging sequence of real numbers: 

Theorem 1.4 (Moderate deviations for the free energy in the REM). 
(i) Let p < v^log2/2. Then, 

log ' 



tN yEZ^^jv^ 

satisfies the large deviation principle. If t^ = o{\/lV), then the corresponding speed is 
tj^ and the good rate function is 



Otherwise, i/ lim inf „„).oo > 0, the LDP holds for any speed 7Ar = o{N) with the good 
rate function 

T( \ i/x = 

I oo tf X U. 
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(ii) Let /3 = v^log2/2. Then, 



log ' 



satisfies, for any scaling t]\f = o{\/log N), the LDP with speed tj^ and good rate function 
L given by 

L{x) = x^. 

Remark 1.2. 

1. Note that the restriction = o{N) is natural in view of the LDP (Theorem II ■2p : if 
one considers deviations of lower order than in the LDP, then the speed of convergence 
to zero of these probabilities is of lower order than the speed occuring in the LDP, 
which was N in our case. 

2. The degenerated rate function appearing in @ reflects the superexponential decay of 
moderate deviation probabilities in case of overscaling. 

3. Observe that for /3 = y^log 2/2 the obtained rate function is L{x) = x^, which matches 
the fact that the limiting distribution in the CLT is M{0, 1/2) and 

1 

N-^oo t\ 



jim — logP(AA(0,l/2) > xt^) = -x^ 



'-N 

for any x > 0. 

2. Proof of Theorem 11.41 

This section is devoted to the proof of Theorem 11.41 which is based on the following 
idea: As a first step, we follow the idea of the CLT's proof and use the approximation 
log(l + x) = X + o{x) for X —7- to defer the proof of the LDP for 



to the proof of the LDP for 



tjy V^^/3,Af 



To that end, we will show in Lemma 12.11 that the random variables (0) and ^ are expo- 
nentially equivalent (for a definition see e.g. Definition 4.2.10 in P]), since it is know that 
exponentially equivalent random variables satisfy the same LDP (see e.g. Theorem 4.2.13 in 
[3]). Then, we are left to prove the LDP for the random variable 



tN ^Zp^N tN2^l^ 



Te5jv 



where (YAr(cr);cr E Sn,N G N) is a triangular array of independent random variables, which 
were defined in ([2|). However, the random variable 5^(0") does not have finite exponential 
moments, which is why we use again the concept of exponential equivalence to switch over to 
the truncated random variables Y^{a), where Y^{a) :=YN{cr) l^Yi^(a)<2'^/^t~^} (^^^ Lemma 
2.2p . which can be studied by means of the Gartner-Ellis theorem (cf. e. g. Theorem 2.3.6 in 
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We prepare the proof of Theorem ()1.4p by stating and proving the above-mentioned lem- 
mata: 



Lemma 2.1. Let /3 < i/log2/2. Then, 



e 2 



(log 2-/32) 



tN 



log I — I and ' 



/3,iV 



tN 



EZ 



I3,N 



are exponentially equivalent for any speed = o{N). 



Proof. Let e > and Tg,Ar :={Zp^j^ — E jv)/IE Z^tv- Since | log(l + x) — x\ < x"^ for all 
X > —1/2, we find 



f (log 2-/32) 



e 2 



tN 



log 



Z, 



/3,Af 



EZ, 



I3,N 



gf(log2-/32) 



^/3,Ar — IE Z^^Tv 



P |log (1 + T^^n) - T^,n\ > e tN e-^ 



+ P[TlN>etN e 



tN 
(log 2-/32) 



-(log 2-/32)^ 



EZ 



/3,/V 



> £ 



< f4 + e-it]^ieT{i°s2-/32)^jEj.^ 



< efP°g2-/32)jg^2^ 



for sufficiently large, where we have made use of Markov's inequality to obtain the last but 
one line. A direct calculation yields 



ET, 



< gAf(/32-log2) 



and, therefore. 



lim sup — log P 

N^oo IN 

< lim sup — log (e 

N^oD In ^ 



gf (log 2-/32) 



tN 



log 



z 



I3,N 



EZ, 



/3,/V 



gf(log2-/32) 



^/3,Af 



EZ, 



I3,N 



EZ 



/3,Ar 



f (log 2-/32) /V(/32-log2) 



-oo. 



> e 



□ 



Lemma 2.2. Let (3 < A/log2/2 and 



assume 



tN 



o{VN) i//3< v /bi272 



Then, 



are exponentially equivalent on the scale tj^. 
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Proof. We get 



> e 



< 2^p(y^(ao)>2^/2t/ 



{y^(<x)>2^'/2t-i} 



> e 



2^P(X,„ >c;v(/3)) 



where cfq £ Sn and 



Making use of the standard estimate 
X 1 



+ 1 ^ 
which holds for ah x > 0, we get 



g-x /2 < P(AA(0, 1) > x) < - 



1 1 



X v/27r 



(8) 



Eat 



yN/2 



tiv2^/2 



< — log ( 2 

1 



creiSjv 
1 P-Civ(/3)V2 



> e 



log 2 



1 



"-TV 

T^log2- 

''TV 



e-civ(/3)V2 
C7v(/3)' log TV 



2f2 



o(l) 



+0(1) 



log2\^ logA^ 



2/3 



2/2 



+ o(l) ^ -oo 



as iV ^ oo. Note that (/3 - log 2/(2/3) )2 > if and only if /3 / y^log 2/2 so that the last hne 
follows from the conditions made on the asympotic behavior of t^. □ 

Now that we have gathered all preliminary results, we can start with a proof of this article's 
main theorem: 

Proof of Theorem \1.4\ We start with a proof of (i)'s first part and (ii). To that purpose, let 
/3 < log 2/2 and assume 



' o{VN) if /3 < Vlog2/2 
o(Vl^) if /3 = Vl^^. 
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By means of Lemma 12.11 and Lemma 12.21 it suffices to prove the desired LDP for 
This follows directly from the Gartner-Ellis theorem once we have proved 

' \2 



lim log E 
for all A G M. Since 



A(A) := <^ 



f if/3<A/^ 



log 2 
2 



t^22^1oK(l+ (E 



„AtAr2-'^/2y^(ao) 



1 



for any ao G ^at, this follows, using the Taylor expansion log(l + x) = x + O(x^) as x — )■ 0, 
from 



t^^2^(E 



,Aijv2-^/2y*(ao) 



1 = A(A) + o(l) 



(9) 



which we are going to prove in the sequel. To that purpose, we calculate the asymptotics of 
the first three moments of Y^{ao) and get 



Ey^(ao) = 0(tAr 2-^/2), 

^Y^iaof = 2A(A)+o(l), 

E|y^(ao)|3 = o(t^i2^/2). 
Ad ([inD: With cnW) (cf. ([7])) we have 

1 fCNiP) 1 



(10) 

(11) 

(12) 



-Af/3V2 



e 2 



eiV/32/2< 2iV/2 eiV/32 

(-vW/5)^d^_P(X,„<c^(/3))^ 



P(X,o >cn{(3)-VnI3) 
Using the standard estimate ^ for a Gaussian random variable, we see 

p{x^,>CNm 



0(1) 



and 



P{X„, > cn{(5) - VN/3) 
P{X„, > cn{P) - Vn/3) = o (^e-(civ(/3)-v^«V2^ 



which yields (|lUp as 

t-2^/2EyA(-o) = o(t^ie^(i-2/-'^V2),-f(^)V^^^ 

(^^)log2/(2/32Mg"f(/3-il^)^ 
= 0(1), 

where we have used in the last line that 



(13) 
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• log 2/(2/32) _ 1 = and (/3 - log2/(2/3))2 = if /3 = ^log2/2 and 

• (/3 - log 2/(2/3))2 > if /3 < ybi^. 

Ad ([HD: It is 



1 



2lT J-oo 



27r j-oo 

]_ /.yiV(log2/(2/3)-/3)+o(l) 



e + o(l) 



1 if/3<^^ 
i if/3 = y^ 



as N ^ oo. 



Ad ([T2]): For every e > it is 
t7v2-f E|y^(ao)P 



t7v2-T E 



+ tiv2-^ E 



= eEy^(ao)' + t]v'2^^(^iv(fTo) >c;v(/3) + 0(iV"'/')) 

= e E (cto) ^ + O 2" ^ 2/(2/3))2 +(l+log 2/(2/3)) log ^ 

= .EyA(ao)2 + o(t:^^^/(^^')-^e-T(^-i°s2/(2/3))2^ 

where we have used the same argument as in ()13p to derive the last line. Thus, with the help 
of pT]) we see 

hm tN2-''/^E\Y^{ao)f < e 

which yields (|12p as e was arbitrary. 

Now, we see that ([9|) follows with the help of (|10p and (jlip from 



E 



i=0 



2JV 



Since Mn 2-^/^Y^{ao) is bounded by A it can easily be seen, using the Lagrange form of the 
remainder in Taylor's formula, that 



E 



i=0 



< '-XH%2-'^/'E[\YUcro)\'] 



which finishes the proofs of (i)'s first part and (ii) with the help of (fT2]) . 
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For the the second part of (i), let = o{N) be an arbitrary speed. It suffices to prove 



Um — log P 



e 2 



(log 2-/32) 



tN 



log 



I3,N 



> e 



-oo, 



lim — log P 



e 2 



(log 2-/32) 



log 



EZ 



I3,N 



< e 



(14) 
(15) 



for any e > 0. The validity of ()14p follows directly from 



— logP 

IN 



— logP 

IN 



gf (log 2-/32) 



tN 



log 



I3,N 



gf (log 2-/32) 



/7Ar 



log 



E Z^^7v 

Zp,N 



EZ. 



13, N 



> e 



> e- 



tN 
JlN 



since (it holds liminf /^/N > and 77V = o{N) in this case) 

tN tN 



and 



lim — log P 

N^oo 77V 



/IN 



gf (log 2-/32) 




00 



/77V 



log 



z. 



I3,N 



EZ 



I3,N 



> s 



' 2 



for any 6 > hy the first part of (i), which we proved above. Finally, this also yields the 
validity of (fTC|) as dH]) implies 



lim P 

N-^oo 



f (log 2-/32) 



e 2 



tN 



log 



I3,N 



EZ, 



I3,N 



< £ 



1. 



□ 



Remark 2.1. A LDP for 



gf (log 2-/32) 



t7V 



log 



Z, 



/3,/V 



EZ, 



/3,iV 



in the case /3 = v^log2/2, hm inf n ^00 tN /\/logiV > is still an open question. By Lemma [2. II 
this random variable is exponentially equivalent to i7v^2^^/^ ^^^^^ Y/v(o") and it can even 
be shown that tJ^2~^/'^^^^g^Y^{a) satisfies the LDP with speed and rate function 
I{x) = under the natural condition t/v = o(\/iV). However, one can show that in this case 
t^^2^^/^ ^^g^^ y7v(c7) and ^^^2~^/^ ^^^^^ Y/v(ct) are not exponentially equivalent, since 
y/v(cT)'s tails become too heavy and extremal events start to dominate the sum's behavior. 
This is the same effect that can be observed in the CLT, where it engenders a breakdown of 
the standard CLT. 
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